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a more interoperable LOCKSS 


•beyond e-resources 

* a solution for 
preserving the digital 
content your 
community cares about 

•APIs + interoperability 

* maximize impact by 

enabling integration + 
interconnection 

* improve sustainability 

by leveraging 
standardized 
community solutions 





lots of copies keep stuff safe 





Global LOCKSS Network 


•subscription e-resources 

• 150+ institutions 

•each runs local LOCKSS node 

• peer-to-peer preservation 

• publishers opt-in 

• post-cancellation access 






lots of LOCKSS 


• LOCKSS (principle) 

• LOCKSS (program) 

• LOCKSS (software) 
•Global LOCKSS Network 

• Private LOCKSS Networks 

•Controlled LOCKSS 
(CLOCKSS) 
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Private LOCKSS Networks (PLNs) 


• community of interest 
•jointly designate content 
•run distributed nodes 



•establish governance 

• preservation via diverse: 

• technologies 

• institutions 

• networks 


PKP 


PUBLIC 

KNOWLEDGE 

PROJECT 
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Meta-Archive 

COOPERATIVE 


SAFE PLN 

SAFE Archiving FEderation 
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COPPUL 


Council of Prairie and 
Pacific University Libraries 



ASS 


DATA PRESERVATION ALLIANCE for the 


SOCIAL SCIENCE' 


The Alabama Digital Preservation Network 

Preserving Alabama's digital resources 


Carinicma 

-* Rede Brasileira de Seruiqos de 
Preseruaqao Digital 
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integration opportunities 


• polling + repair 

• repository replication 

• other distributed digital 
preservation systems 

•access 

• Dockerized Solr 
indexing for WARC'ed 
content 

• DOI + OpenURL access 
to web archives 

•metadata extraction 





why re-architect LOCKSS? 


• reduce support + operations costs 

• leverage web-scale open-source software 

• align w/ web archiving mainstream 

•de-silo components + enable external integration 

• metadata extraction 

• archive access via DOI + OpenURL 

• polling + repair protocol 

• prepare to evolve w/ the Web 

• web services architecture as flexible foundation 


leveraging community components 


LOCKSS harvester 


headless browser 


archival crawler 
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other 

LOCKSS 

boxes 


access 

control 


LOCKSS poller 
service 


web archive 
replay 
platform 
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LOCKSS 
metadata 
query service 


indexing 

service 



full-text 

index 


data processing environment 

















aligning with web archiving 


Web ARChive (WARC) 
format 



compatible technologies 

• Heritrix 

• OpenWayback 

• WarcBase 

•Web Archiving Proxy 




API candidates 


•capture tool/proxy 
interconnect 

•capture tool 
management 

•data import/export 

•query + extraction 
• integrity audit + repair 
•descriptive metadata 


• logs + analytics 

• renderings/derivative 
formats 

•federated data delivery 

•federated replay 

•federated full-text 
search 
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web archiving system APIs (WASAPI) 


National Digital Platform Projects funded in August 2015 

Systems Interoperability and Collaborative Development for Web Archiving 

(LG -71-15-0174-15' ; The Internet Archive, working with partner organizations 
University of North Texas, Rutgers University, and Stanford University Library 
will undertake a two-year research project to explore techniques that can 
expand national web archiving capacity in several areas. 
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development roadmap 

•2017 

• Docker-ize components 

• web harvest framework 

• polling + repair web 
service 

•2018 

• IP address + Shibboleth 
access via OpenWayback 

• OpenWayback format 
negotiation framework 

• full-text search web 
service 
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